Automatic Speaker Recognition Using Gaussian Mixture Speaker Models

نویسنده

  • Douglas A. Reynolds
چکیده

• Speech conveys several levels of information. On a primary level, speech conveys the words or message being spoken, but on a secondary level, speech also reveals information about the speaker. The Speech Systems Technology group at Lincoln Laboratory has developed and experimented with approaches for automatically recognizing the words being spoken, the language being spoken, and the topic of a conversation. In this article we present an overview of our research efforts in a fourth area-automatic speaker recognition. We base our approach on a statistical speaker-modeling technique that represents the underlying characteristic sounds of a person's voice. Using these models, we build speaker recognizers that are computationally inexpensive and capable of recognizing a speaker regardless ofwhat is being said. Performance of the systems is evaluated for a wide range of speech quality; from clean speech to telephone speech, by using several standard speech corpora.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speaker Verification System Using Gaussian Mixture Model & UBM

In This paper presents an overview of a stateof-the-art text-independent speaker verification system. The objective of automatic speaker recognition is to extract, characterize and recognize the information about speaker identity. First, an introduction proposes a modular scheme of the training and test phases of a speaker verification system. Then, the most commonly speech parameterization use...

متن کامل

Text-Independent Speaker Recognition Using Gaussian Mixture Models Final Term Paper Proposal

The proposed project is an implementation of speaker recognition systems, both identification and verification. The systems are built using Gaussian Mixture Models, as proposed in several papers from Douglas A. Reynolds. The use of Fractional Covariance Matrix is studied as an possible increase for the traditional recognition systems. keywords: speaker recognition; Gaussian Mixture Models; like...

متن کامل

Forensic speaker recognition based on a Bayesian framework and Gaussian mixture modelling (GMM)

The goal of this paper is to establish a scientifically founded methodology for forensic automatic speaker recognition. The interpretation of recorded speech as evidence in the forensic context presents particular challenges. The means proposed in the paper for dealing with them is through Bayesian inference. This leads to the formulation of a likelihood ratio measure of evidence which weighs t...

متن کامل

Automatic Speaker Recognition for Forensic Case Assessment and Interpretation

Abstract Forensic speaker recognition (FSR) is the process of determining if a specific individual (suspected speaker) is the source of a questioned voice recording (trace). The forensic expert’s role is to testify to the worth of the voice evidence by using, if possible, a quantitative measure of this worth. It is up to the judge and/ or the jury to use this information as an aid to their deli...

متن کامل

Recognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model

Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....

متن کامل

Speaker recognition from coded speech in matched and mismatched conditions

We investigate the effect of speech coding on automatic speaker recognition when training and testing conditions are matched and mismatched. Experiments use standard speech coding algorithms (GSM, G.729, G.723, MELP) and a speaker recognition system based on Gaussian mixture models adapted from a universal background model. There is little loss in recognition performance for toll quality speech...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995